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The impact of inhomogeneous arrangement of nodes in space on network organization cannot be 
neglected in most of real-world scale-free networks. Here, we propose a model for a geographical 
network with nodes embedded in a fractal space in which we can tune the network heterogeneity 
by varying the strength of the spatial embedding. When the nodes in such networks have power- 
law distributed intrinsic weights, the networks are scale-free with the degree distribution exponent 
decreasing with increasing fractal dimension if the spatial embedding is strong enough, while the 
weakly embedded networks are still scale-free but the degree exponent is equal to 7 = 2 regardless 
of the fractal dimension. We show that this phenomenon is related to the transition from a non- 
compact to compact phase of the network and that this transition accompanies a drastic change of 
the network efficiency. We test our analytically derived predictions on the real-world example of 
networks describing the soil porous architecture. 

PACS numbers: 



I. INTRODUCTION 

Scale-free organization of networks [H-[l| seems to be 
the underlying principle common of many complex sys- 
tems. In real- world networks, examples include the In- 
ternet, social networks or communication networks [U- 
9], the inhomogeneous arrangement of nodes in space 
strongly impacts the network organization and the link- 
ing rules must include the dependence on the distances 
between nodes [TohT^ . However, most of the network 
models studied so far considered either randomly dis- 
tributed nodes in metric space (l3 - fl7| or nodes placed 
on a lattice [l8r-22]. The importance of inhomogeneous 
spatial positions of nodes was emphasized in Q where it 
was shown that the fractality of the space is one of the 
universal parameters that constrain the Internet mod- 
els describing Internet's large-scale topology and its ob- 
served scale-free character [2a - l25j at the router and au- 
tonomous system level. While preferential attachment 
[U, HI seems to be the main underlying mechanism struc- 
turing the Internet, the original form of the preferential 
attachment Q should be altered [3, HI] to account 
for the observed spatial and/or functional heterogeneity 
of the nodes. 

Here, we study the structure of networks formed by ge- 
ographical network model in which the nodes with power- 
law distributed intrinsic weights (i.e., fitnesses [27H29| ) 
are embedded in a fractal space. We show analytically 
that the networks produced by such model are scale-free 
with the degree exponent influenced by the fractal di- 



mension of the embedding space if the spatial embedding 
is strong enough. By explicitly deriving the degree and 
the edge-length distribution functions, we classify these 
networks into non-compact phase with infinite average 
degree and average edge length and compact phase with 
finite average degree and average edge length, separated 
by an intermediate phase characterized by finite average 
degree and infinite average edge length. It is also shown 
that the transition between these phases accompanies a 
drastic change of the network efficiency. Finally, we use 
our findings in the analysis of the networks describing 
the soil porous architecture as an example. 



II. MODEL 

In order to describe an inhomogeneous distribution of 
N nodes in a space of linear dimension L, the nodes 
are isotropically distributed in a fractal manner with the 
fractal dimension D. In the theoretical treatment we 
employ an artificial boundary conditions for which every 
node position is statistically equivalent. This is slightly 
different from usual periodic boundary conditions leading 
to an anisotropy. The number of nodes is thus given by 
N = pfl l ^ 1 dl, where 51 is the D-dimensional solid 
angle and p is the density of nodes. Each node has a 
real and continuous fitness x randomly assigned accord- 
ing to the distribution function s(x). If a node pair (i-j) 
satisfies 
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then these two nodes are connected by an edge, where 
Uj is the Euclidean distance between the nodes i and 
j, m(> 0) is a real parameter quantifying the strength 
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of the spatial embedding (namely, the strength of the 
geographical effect), 6 is a threshold value, and F(x, y) is 
a function relating the two fitnesses to the connectivity. 
If m = 0, we have a conventional fitness model which 
provides a scale-free network for a variety of combinations 
of forms of F(x, y) and s(x) [13,[23|- Here, we concentrate 
on the case of 

F(x,y)=xy, (2) 

and 

s(x) = s Q x~ a , (3) 

where x (and y) are in the range of [a; m i n , oo) and a > 1. 
From the normalization condition of s(x), we have 

So = (a - ■ (4) 

Similar geographical network models have been studied 
under the assumption that nodes are homogeneously dis- 
tributed in a Euclidean space [ill [ItJ , while here we in- 
vestigate how the inhomogeneity (fractality) of spatial 
distribution of nodes affects properties of the network. 

III. DEGREE DISTRIBUTION FUNCTION 

First, we calculate the degree distribution function of 
the network formed by the above geographical algorithm. 



Let ki(l)dl be the number of nodes connected to the node 
i and included in a thin spherical shell of the radius I 
(width dl) centered at the position of the node i. Since 
the distance between the node i and a node in the shell 
is I, the connectivity condition Eq. (flj tells us that nodes 
with Xj > Ql m /xi can connect to the node i. Thus, the 
number of connected nodes ki(l)dl is 



ki(l)dl — n(l)dl ■ / s(x) dx , (5) 

J&l m /Xi 



where n{l)dl = pfll D ~ 1 dl is the number of nodes in 
this spherical shell. In this expression, we assume that 
Ql m /xj > x m - ln , which is equivalent to I > l m i n {xi) where 



W^) = (^) 1/m • (6) 



Nodes within the distance l m in(xi) from the node i can 
connect to the node i. Thus, using Eq. @, we have 
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Integrating ki(l) with respect to I over (0, L] we obtain 
the total number of nodes connected to the node i, or the 
degree fcj of the node i given by 



ki — 



i(a-l) 
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It should be noted that geometrical coefficients in two 
terms of Eq. |5]) coming from the volume integration are 
not very meaningful for realistic systems because of our 
artificial boundary conditions. In the case of D — m(a — 
1) > 0, the first term of Eq. © dominates the second 
term when L is sufficiently large and we have ki oc . 
The degree distribution function P{k) calculated from 
the relation \P(k)dk\ = \s(x)dx\ is then proportional to 
k~ 2 independently of m, a, and D. If however, D — 
m(a — 1) < 0, the second term of Eq. ([5]) becomes much 
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larger than the first term. The relation ki oc xf^ m in 
this case leads to P(k) oc fc-"»(a-i)/D+i. Therefore, the 
degree distribution function of the present geographical 
network model is given by 



P(k) oc 




m < TO c o 
m > m c o 



where 



D 



m c0 



1 



(9a) 
(9b) 
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The distribution function P(k) obeys power-law forms in 
both cases of m < m c0 and m > m c0 , and the degree 
exponent 7 is 




1+1 



m < m c o 
m > m c o 



(11) 
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The geographical inhomogeneity of the node distribution 
does not affect the degree distribution of the network for 
a weak geographical effect (small m) , while 7 depends on 
D for a strong effect (large to). 

In the above argument, we assumed that the system 
size L is always larger than l m i n (xi) for any Xi, because we 
are interested in the thermodynamic case (L — > 00). This 
is, however, not obvious, because Xi [and then Z m i n (xj)] 
can also diverge in the thermodynamic limit under a con- 
stant density p. Let us consider carefully this condition 
L > l m in(xi). In a finite system with N = p£lL D /D 
nodes, the fitness x is truncated at a finite value. The 
maximum fitness £ max is given by N J s(x)dx = 1. 
Using the fitness distribution Eq. (|3J) with Eq. (fj|, the 
quantity x max is given by 



7yl/(a-l) 



(12) 



Thus, the length l m i a (xi) can be as large as l m i n (x max ), 
where 



^min(-£max) 



(pn)V(«-l)Lf/(a-l) a 



0DV(a-l) 



l/n 
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From the above expression, the condition Zmm^max) < L 
is equivalent to > 0q, where 



©o 



pCl 
~D 



(14) 



If m > TO c o, the quantity ©o goes to zero in the thermo- 
dynamic limit and any finite satisfies > ©o, namely, 
^min(^max) < L. Thus, the degree distribution function 
P(k) is given by Eq. (|9bj) in the thermodynamic limit 
with to > TO c o. On the contrary, if m < m c o, is always 
less than ©o because 0o diverges as L — ► 00. In this case, 
there must be nodes satisfying l m in(xi) > L for which 
Xi > 0L m /ir min . Since the condition xi > 9L m /i min 
implies that any node in the whole system can connect to 
the node i, the degree of such a node is N— 1 independent 
of Xi. Thus, these nodes give an additional S- functional 
contribution [S(k — N + 1)] to the degree distribution 
P(k) given by Eq. (|9"aj) for nodes with xi < QL m /x 
Let us estimate the magnitude of this ^-functional con- 
tribution. It is proportional to the number of nodes no 
having Xi > QL m /x m - ln . The quantity no is given by 
N /e^m/j. s ( x ) dx and can be written as 



Since m(a — 1) is always positive, the (S-functional con- 
tribution vanishes in the thermodynamic limit. There- 
fore, Eq. is valid both for the cases of to < m c o and 
to > TO c o in an infinite system. For a finite L, however, 
© can be chosen to be less than ©o independently of to 
and the ^-functional contribution remains finite. Thus, 
the degree distribution function of a finite system with 
< ©o must have the ^-functional correction term, that 



P{k)= Po k- 



-m(cK— 1) 



<y(k-jv + i), (17) 



for both to < to c o and m > m C Q. Here, po is a normaliza- 
tion constant and the exponent 7 is given by Eq. (jlll) . It 
should be noted that Eq. (fT7|) holds for a finite but large 
L because the first term of Eq. (fTT)) is valid in the large 
L limit. 

The quantity ©o is a characteristic value of peculiar 
to a finite system with a fixed density p. There are two 
other characteristic values of for a finite L. One is m in 
below which every node can connect to all other N — 1 
nodes. Network constructed under < m j n becomes 
the complete graph. Obviously, m i n is given by 



©n 



(18) 



Another characteristic is ma x above which no node 
can connect to any other nodes. Network with > max 
is a set of isolated nodes. We have 



max 



(19) 



where = {pVL/D)- 1 ' is the minimum edge length. 
Using Eq. (Tl"2"j) . © ma x is written as 



©r 



(£)' 



t 1 T - 

■^min-^ 



(20) 



In a finite system we always assume that © satisfies the 
condition @ m i n «C © © ma x leading to non-trivial net- 
works. It is not necessary to consider this condition in 
min- an infinite system because © m i n and © max vanishes and 
diverges, respectively, in the thermodynamic limit. 



IV. RELATION BETWEEN (k) AND 6 



n = N 



— m{ot— 1) 



where £ is the node-pair distance defined by 



£ — ^min (^min) 



x 2 ■ 
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l/r, 



(15) 
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below which the two nodes are connected independently 
of the fitness. The properly normalized (^-functional part 
of P{k) is then presented by (L/0~ m(a_1) <5(fc -N + l). 



From Eq. ([5]), it is clear that the average degree (k) 
diverges for to < m c o because of 7 = 2 and it remains 
finite for to > m c o in the thermodynamic limit. The 
average degree (k) in a finite system is, however, always 
finite and depends on 0. A large restricts a connection 
of a node pair and leads a small (fc). It is important to 
know the relation between and (k) for a finite but large 
system. We calculate the average degree by 



(k) 



kis{x) dx , 



(21) 
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instead of (k) = J kP(k)dk, using the derived expression 
(not asymptotic form) of ki for a finite system. 

Let us consider Eq. (f2"Tj) separately for < Oo and for 
© > O . In the case of < 6o, the length lmi n (xi) can be 
larger than L, which implies QL m /x min < x max . Thus, 
the integral of Eq. (|2Tj) is separated into two regions 



(k) = 



k(x)s(x)dx+(N-l) / s(x)dx, 

J@L m /x min 

where k{x) is given by Eq. (|5J) regarding xi as a continu- 
ous variable x. The coefficient N — 1 in the second term 
comes from the fact that nodes satisfying x% > <dL m /x m in 
connect to all N — 1 nodes. Using Eq. flSJ) and approxi- 
mating JqI^/x dx by J erm ;„ dx, we have 



>&L m /x n 

(k) = Xp 
+Zp 



e 



a-l 



L D-m{ a -X) { 
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where 



X 
Y 
Z 



n(a - 1) 
D - m(a - 1) ' 

d-2m(a-l) 



exp 



(a-l)[d-m(a-l)] 
fim 2 (a-l) 2 



- m(a - l)] 1 



(23) 

(24) 
(25) 
(26) 



It should be again emphasized that these geometrical 
quantities X, Y , and Z resulting from the volume inte- 
gration depend strongly on the boundary conditions and 
are not very meaningful. We can evaluate the asymp- 
totic behavior of (k) for large L by using Eq. (|!23"|) . In 
the case of to < m c o, the first term of Eq. (|23p obviously 
dominates the second term. Then, ignoring unimportant 
geographical coefficient, (k) behaves asymptotically as 



(k) ~ e 1_a (loge + c) 



(27) 



where c is a constant depending on the boundary con- 
dition. On the other hand, a careful treatment is re- 
quired for to > to-co. It seems that the second term of 
Eq. (|23[) dominates the first term for to > to c o in the 
thermodynamic limit. However, we should note that 
must be infinitesimal to satisfy the condition < 0o 
in this calculation because 0o for L — > oo goes to zero 
for to > to c o so it is not obvious which term is dom- 
inating in Eq. (1231) . In order to find the dominant 
term, we evaluate the lower bounds of these terms by 
replacing by ©o and using 1|14[) . Then, the lower 
bounds of the first and the second terms are proportional 

tologlYipn/D) 1 ^ - 1 ^ ^ -^} and^t 1- ^^)] respec- 
tively. This suggests that the second term dominates the 
first term for large L because the exponent 1 — 
positive for to > m C Q. We have then 

(k) ~ 0-°/™ , 



D 



m(a—l) 



IS 



(28) 



for < ©o and to > m C Q. At m = to c o, both terms in 
Eq. (|2"3")l should be considered. 

Next, we treat the case of > ©o- Here l m i n (xi) is 
always less than L, and the degree fcj is given by Eq. © 
for any Xi in the range of x m i n < Xi < a; max ■ The average 
degree (k) is then simply presented by 



a-l 



k(x)s(x) dx 







L D - m ( a -V log (WL D ) 



-pZ[D - m{a - 1)] 



-pZ[D — m(ot — 1) 











D/m 



(WL 



DW(o-i)- 



D/r 



(29) 



where W = pQ/D and we used Eq. {SJ for k(x) and 
Eq. (fP2|) . For to > to c o, it is easy to understand that the 
third term dominates other two terms for large L. So we 
have 



(k) - Q- D/r 



(30) 



In the case of m < to c o, the infinitely large © must be 
considered when we find the dominant term of Eq. (|29p , 
because is larger than ©o and ©o diverges for m < to c o 
in the thermodynamic limit. As in the case of < ©o 
and m > to c0 , replacing © in Eq. (j2T))) by O , the L de- 
pendence of the upper bounds of these terms points to the 
dominant term. Since the upper bounds of the first, sec- 
ond, and third terms are proportional to \og(pVLL D / 'D), 

L°, and L D ^ ' , respectively, the first term domi- 
nates the second and third terms because of to < m C Q. 
Therefore, the average degree (k) is asymptotically given 

by ~ 



(k) ~ 1 



(31) 



This relation differs from Eq. (|27| by a logarithmic cor- 
rection. 

In summary, the relation between and (k) is given 
by 



(k) 



for < ©n and 



e 1_a Cioge + c) 

Q-D/m 



, to < to c o (32a) 
, to > to c o (32b) 



(k) 



i-« 

Q-D/r, 



TO < TO c o 
TO > TO C Q 



(33a) 
(33b) 



for > ©o- At to = to c o, (k) is related to © through 
Eq. ||23J for < ©o and through Eq. @ for > Q , 
because every term contributes equally to (k) even in 
the thermodynamic limit. A similar result to Eq. (|33[) 
has been obtained by [l7T | where the nodes were uni- 
formly distributed in a c?-dimensional space with the 
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L-max norm and to is fixed at to — D(— d). The 
asymptotic L dependence of (k) for a fixed can be 
also evaluated from Eqs. (|23|) and (|29f . In the case of 
to < to c o and enough large L (then, < Go), the domi- 
nant first term of Eq. ([23]) gives (k) cx LD-m(a-i) logL 
For to > TO c o (and then > 0o), we have (k) oc L°. 
These L dependences are consistent with those calcu- 
lated by (k) = ff' 1 kP(k) dk by using Eqs. (H7J) and 
(|9bp for to < to c0 and to > to c0 , respectively, and taking 
into account the L dependence of po in Eq. QI7p . 



We should remark that R(l) for m < m c o goes to infinity 
as I — > oo. Since the length I does not exceed the size L in 
a finite system, the distribution R(l) is actually truncated 
at I = L. In the thermodynamic limit, however, we must 
consider the infinitesimal normalization constant coming 
from the denominator of Eq. (|37[) . 

In the case of I < £ (namely, x min > xi), k(x,l) 
in Eq. (|34j) is given by Eq. (|7b[) for any x in the in- 
tegration range [x m i n ,x max \. Thus, k(l) is given by 
pfll ^ 1 J x s(x)dx, namely, 



EDGE-LENGTH DISTRIBUTION 
FUNCTION 



k(l) = pfll 



D-l 



(39) 



and then, 



In this section, we derive the edge-length distribution 
function R(l) of our geographical networks embedded in 
a fractal space. To this end, we regard ki(l) given by 
Eq. (|7|) as a continuous function k(x,l) of the fitness x 
and the edge length I. The average number of edges, 
k(l)dl, of length [I, I + dl] from a given node is obtained 
by averaging k(x, I) over the fitness x, i.e., 



MO 



s(x)k(x, I) dx. 



(34) 



Equation ([7]) expresses the forms of ki(l) by separating 
two cases I > l m in(xi) and I < l m i a (xi) for a fixed Xi. 
Corresponding to this classification, k(x, I) in Eq. (|3"4")l for 
a fixed I has different forms for x m i n < x < 0/ m /x m ; n and 
0Z m /a; m i n < x < x max , respectively. Thus, the integral 
of Eq. (fMl) is calculated as 



k(i) = P n 





+ P m D - 1 



iD-l-m(a-l) 



s(x) dx , 



s(x)x a 1 dx 
(35) 



where xi = &l m /x m i n . Here, I is assumed to be larger 
than £ (namely, x m i n < x{) and the upper cut-off of the 
integral is, as an approximation, extended to infinity. Us- 
ing Eqs. ©, flU, (H), and t$TE§, k(l) is expressed by 



k(i) = P m 



D-l 



I + m(a 



(36) 



The probability distribution function R(l) is given by 

Hi) 



R(l) 



k(l')dl' 



(37) 



where the integration in the denominator is done over the 
whole range of I. Neglecting the normalization constant, 
the edge-length distribution is 



R(l) oc I 



D-l 



— rn(a— 1) 



I + m(a 



l!lu,(| 



(38) 



R(l) oc I 



D-l 



(40) 



It should be noted that the distribution R(l) depends 
on the fractal dimension D independently of m while 
the degree distribution P{k) does not depend on D for 
to < m c o (weak geographical effect region) . 

Two expressions Eqs. (J3HJ) and flM]) of R{1) for I > £ 
and I < £ must coincide at I = £. This condition con- 
cludes that the proportionality coefficients for Eqs. (|38|) 
and (|4U|) are identical. We can derive the common co- 
efficient C for an infinite system from the normalization 
condition of R(l) given by 



i = c / 4 1 

'a 



D-l 



dl 



d/ , (41) 



where f3 — m(a — 1). When to > to c o, this equation 
provides a finite coefficient expressed by 



c= D (i _m s o 



m 



(42) 



while C is infinitesimal for to < to c o as mentioned above. 
It should be noted that the coefficient C does not de- 
pend on the boundary condition because the boundary- 
condition dependent factors in the numerator and the 
denominator in Eq. (f3"T|) are canceled out. 

From the above argument, we can immediately de- 
rive the probability g(l) of two nodes with distance I to 
be connected. This probability is given by the ratio of 
the number of connected nodes k(l)dl to n(l)dl nodes lo- 
cated at distances [/,/ + dl] from a given node, namely, 
g(l) = k(l)/n(l). Since n(l) = pVll 0-1 as mentioned be- 
low Eq. © and k(l) is given by Eq. (I36[) or (|39j) for / > £ 
or / < £ respectively, we have 



9(1) 




-m(a— 1) 



I + mux - J. i lou, ( ^ 



(43) 
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FIG. 1: Typical Sierpinski geographical network. Nodes are 
located on vertices of the Sierpinski gasket in the 6th genera- 
tion (N = 366). Parameters to form the network are a — 2.0 
and m = 3.0. The threshold <d is chosen to satisfy (k) — 10.0. 



This expression can be alternatively derived directly from 
the meaning of g(l), 



1(1) 



s(x)dx / s(y)dy . (44) 

Jxy/l™>0 



Considering that two nodes are always connected if the 
fitness of one node exceeds xi{~ G^ m /£ m in), we can sep- 
arate the above integration into two parts as 



9(1) 



s(x)dx 



s(x)dx / s{y)dy , (45) 



for x\ > x m - m (I > £). For xi < x m ; n , the integral range 
of the second integral in Eq. (|4"4")l is spread over the whole 
region of y, then g(l) = J°° f°° s(x)s(y)dxdy. These 
equations again lead Eq. (|4"5|) if we use Eq. . We should 
note that the probability g{l) given by Eq. (j4"3")l does not 
depends on the fractal dimension D for any value of m. 



VI. NUMERICAL CONFIRMATIONS 

The above analytical results are numerically verified 
in this section. At first, we confirm the behavior of the 
degree distribution function P(k) for geographical net- 
works on the Sierpinski node set (Sierpinski geographical 
networks). Nodes are located on vertices of the Sierpin- 
ski gasket with the fractal dimension D = log 3 / log 2 sa 
1.585 and connected by edges according to the condition 
Eq. ([T]) with Eqs. ([2]) and In our numerical calcula- 
tions in this work, the lower cut-off x m in is set to be unity. 
A typical network on the 6th generation Sierpinski gas- 
ket (N = 366) is depicted in Fig. 1. There exist nodes 
(hubs) possessing a large number of edges. Numerically 
calculated degree distribution functions P(k) for two net- 
works with different m and a are presented in Fig. 2. The 
scale-free property of networks formed by our algorithm 
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FIG. 2: (Color online) Degree distribution of Sierpinski geo- 
graphical networks in the 9th generation (N = 9,843). The 
threshold O is chosen to satisfy (k) — 120.0 for good statis- 
tics. Parameters a and m are set as (a) a — 2.0 and m = 4.0 
and (b) a — 3.0 and m — 1.0. Thick dashed lines through 
dots indicate the slopes 7 predicted by Eq. JTTJ . Thin dashed 
line in Fig. 2(a) represents the slope calculated by using the 
Euclidean dimension d = 2 instead of the fractal dimension 
D = 1.585. 



is clearly shown in this figure. The scale-free exponents 
7 calculated numerically for many Sierpinski geographi- 
cal networks with different combinations of a and m are 
plotted in Fig. 3 as a function of m(a — 1)/-D- In this 
figure, values of 7 for networks with nodes distributed 
homogeneously in two-dimensional Euclidean space (2d- 
geographical networks) are also plotted. We should re- 
mark that our theoretical arguments are valid for a geo- 
graphical network with homogeneously distributed nodes 
which is a special case with the Euclidean dimension d 




2.0 

m(a-1)/D 

FIG. 3: (Color online) Degree exponent 7 as a function of 
m(a — 1)/D. Solid line shows the theoretical prediction 
Eq. ()lip . Filled circles and squares represent numerically 
obtained 7 for Sierpinski geographical networks (9th genera- 
tion) and 2d-geographical networks (iV = 1,000) with several 
combinations of a and m. Values of 7 are evaluated by the 
least-squares fit. 
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FIG. 4: Relation between the average degree (fc) and the 
threshold value O for (a) m < m c o and O < Oo, (b) m < m c o 
and > 0o, (c) m > m c o and < ©o, and (d) m > m c o and 
> 0o- Dots represent results calculated numerically for 2d- 
geographical networks. Details of parameters and conditions 
are given in the text. Solid curves show the theoretical pre- 
dictions given by Eqs. (|32[l and (|33p . Dashed line in Fig. 4(a) 
indicates the slope given by Eq. (|32a[) without the logarithmic 
term. 



instead of the fractal dimension D. In fact, Eq. (fTTj) with 
D = d reproduces the results by 0, [TtJ • We see that all 
numerical results collapse onto the theoretical line given 
by Eq. (fTTj) . These results strongly support the theoret- 
ical predictions on P{k) presented in Sec. Mil 

Next, we confirm numerically the relation between (k) 
and 6 for 2d-geographical networks. We have to evalu- 
ate &o defined by Eq. (ITU) to distinguish four regions 
with respect to to and © corresponding to Eqs. (j32|) 
and (|33[) . In addition, must satisfy the condition 
©min < B <C max , where e min and max are given 
by Eqs. JTSJ and ([20]l. respectively. Rewriting Eqs. {H} 
and @ by 9„ = ATV^-i^L"™ and 8max = 
N [w+ m ( a -i)]/[D( a -i)] x ^ L - m ^ we can estimate q q an d 

©max without treating the boundary-condition depen- 
dent geometrical factor pfl/D. Figures 4(a) and 4(b) 
show numerically calculated (k) as a function of © for 
to < m c o. In these calculations, the exponent to charac- 
terizing the strength of the geographical effect are cho- 
sen to be 1.0, and N — 1,000 nodes are distributed in 
a square of size L — 100.0. The fitness x is allocated 




FIG. 5: (Color online) Edge-length distribution function R(l) 
for 2d-geographical networks in squares of size L — 10.0. In 
order to obtain the numerical results (dots), we employ a = 
2.0 (m c o = 2.0), m = 3.0, and N = 1,000. The threshold 
value is chosen as = 125.2 so that (k) becomes equal to 
10.0. The results are averaged over 1, 000 realizations. Solid 
and dashed lines represent the theoretical results given by 
Eq. (|38|) and (140 [) with the common prefactor presented by 
Eq. (|42p . The inset shows the same result in a logarithmic 
scale. The length £(= 0.20) defined by Eq. lfT6)) is indicated 
by the arrow. 



to each node according to the distribution function s(x) 
given by Eq. ([3]) with a = 1.5. The square system has 
the periodic boundary conditions in the x and y direc- 
tions. Since m c o = 4.0, ©o = 10 4 , m i n = 0.01, and 
©max = 3.2 x 10 11 from these parameters, the conditions 

©min < © < ©0 < ©max and min < O < © < ©max 

are satisfied for Figs. 4(a) and 4(b), respectively, and 
777 < 777 c o for both figures. Numerically calculated {k)s 
are well described by solid lines representing Eqs. (|32a[) 
and (|33ap , where the constant c in Eq. (|32ap and pref- 
actors are suitably chosen. Results for m > m c o are 
presented in Figs. 4(c) and 4(d) which show the (fc)-0 
relation for < ©o and > 0o, respectively. As in 
the cases of Figs. 4(a) and 4(b), nodes are distributed 
in a square of size L = 100.0. Parameters a and m 
are chosen as a = 2.0 and m = 3.0 so that the con- 
dition m > 777 c o(= 2.0) is satisfied. In order to realize 
the condition min <C 6 < ©o < ©max in Fig. 4(c), 
we treated a large network with N = 100, 000, which 
has min = 10~ 6 , max = 3.2 x 10 11 , and O = 0.1. 
On the contrary, the number of nodes for Fig. 4(d) is 
N = 1, 000, in which min < ©o < © < ©max is satisfied 
with min = 10~ 6 , max = 3.2 x 10 4 , and O = 0.001. 
Similarly to Figs. 4(a) and 4(b), numerical results agree 
well with the theoretical results shown by solid lines. 

Finally, the behavior of the edge-length distribution 
function R(l) is examined also for 2d-geographical net- 
works. Results (dots) shown in Fig. 5 are calculated in 
the condition of to > to c o, for which R(l) can be properly 
normalized even in the thermodynamic limit. Solid and 
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TABLE I: Three phases of geographical networks embedded 
in a fractal space. 



Phase 


Region of m 


(k) 


(0 


Non-compact 


m < m c0 


infinite 


infinite 


Intermediate 


rrico < m < m c i 


finite 


infinite 


Compact 


m > m c i 


finite 


finite 



dashed curves represent the theoretical prediction given 
by Eq. (|38p and P0|) with the common prefactor pre- 
sented by Eq. (14*21 . It should be emphasized that there is 
no fitting parameter to obtain the theoretical curve. Nu- 
merical results agree quite well with the theoretical pre- 
diction. The peak structure and the tail profile slightly 
deviating from a power law (see the inset of Fig. 5) result 
from the logarithmic term of Eq. The reason of the 

slight deviation between numerical results and the theo- 
retical line for I < 0.05 (see the inset) is due to a finite 
A; defined below Eq. (QI]). 



VII. COMPACTNESS AND EFFICIENCY 

From the functional forms of P(k) and R(l), we can im- 
mediately find the convergence property of the average 
degree (k) and the average edge length (I) in the ther- 
modynamic limit. As argued at the end of Sec. IIV1 the 
quantity (k) for m < m c o diverges as £ J D - m ( a - 1 ) \ g j_, 
and it converges for m > m c o- On the other hand, the 
convergence of (l) is governed by Eq. ((55)) describing R(l) 
in the asymptotic I regime. From Eq. (|38[) . the average 
(I) diverges for m < m c \ and remains finite for to > to c i, 
where 



m c i 



D + l 

a — 1 



(46) 



The behavior of (k) and (I) suggests that the impact of 
the geographical effect can be classified into three regions. 
For < m < to c o, both quantities (k) and (I) diverge in 
the thermodynamic limit. Since a node connects with 
a huge number of nodes far away we call this region the 
non-compact phase. In the same sense, the region of m > 
to c1 is termed the compact phase where both quantities 
(k) and (I) remain finite and a node connects with only a 
small number of nodes in its vicinity. In the intermediate 
phase, i.e., m c o < m < m cl , the average degree converges 
but (I) diverges. These regions are summarize in Table [I] 

Let us consider the relation between the compactness 
of a network and the geographical efficiency e defined by 



-T- 

1) 4-f r„ ' 



(47) 



where ry is the shortest Euclidean distance between two 
nodes i and j along a network path. This quantity is a 
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FIG. 6: (Color online) Rescaled geographical and global effi- 
ciencies as a function of m/m c o for Sierpinski (solid lines) and 
2d- (dashed lines) geographical networks. For both network 
systems, we employ a = 2.0 and O giving (k) = 10. 0, and 
all results are averaged over 1,000 realizations. Numbers of 
nodes in Sierpinski geographical networks in the 5th, 6th, and 
7th generations are f23, 336, and f,095, respectively. 



natural extension of the global efficiency E defined by 

2 s 

1 ' J 'Hj 



E = 



N(N - 1) 



-T- 



(48) 



where dij is the shortest network distance between two 
nodes i and j introduced in [30(. Both quantities e and 
E characterize the efficiency of the information exchange 
or the flow in the network. If the efficiency is governed 
mainly by the Euclidean distance along the path rather 
than the number of steps, the geographical efficiency e is 
more meaningful than E, and vice versa. 

We calculated numerically these quantities and ex- 
amined how the compactness is related to the efficien- 
cies e and E. Solid lines in Fig. 6 represent the ef- 
ficiencies for Sierpinski geographical networks in three 
different generations (having the same linear dimen- 
sion L = 1.0), while dashed lines show them for 2d- 
geographical networks with different numbers of nodes 
in squares of size L = 10.0. In order to eliminate the 
L and N dependence of the maximum and minimum 
values of the efficiencies, we plot the rescaled quantities 
e and E defined by e = (e - e min )/(e max - e min ) and 
E= (E-E, 

^max 



mm) and 
re- 



)/(-Emax - Emm), where e min (E, 
(E max ) are e(E) at m/m cQ = 10~ 2 and 10 2 
spectively. The rescaled geographical (global) efficiency 
e (E) rapidly increases (decreases) near m = m c o as in- 
creasing to. From the fact that enlarging the system the 
slope near to = m c o becomes steeper and the point at 
the intersection of e and E approaches m = m c o, the 
efficiencies e and E in the thermodynamic limit are sup- 
posed to show step-like forms at m = rn C Q. This behavior 
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FIG. 7: 2d X-ray CT image of a soil structure with network 
edges connecting pores (black patches) according to Eq. ([T]) 
with m = 5 and 6 = 0.01. From \M- 




Pore size x (pixel ) 

FIG. 8: (Color online) Pore-size distribution obtained from 
the image analysis of the soil structure shown in Fig. 7. 
Dashed line shows the power-law fit s(x) ~ x~ a to the distri- 
bution with a = 1.6. 



can be interpreted as follows. In the non-compact phase 
(to < to c o), existing short-cut edge (in the topological 
sense) connects a starting node and a target node by a 
small number of edges with going back-and-forth around 
the target node. This back-and-forth motion, however, 
requires an extra Euclidean distance and leads the low ge- 
ographical efficiency. On the other hand, in the compact 
phase, the network structure resembles the structure of 
a regular lattice (or regular Sierpinski gasket). Although 
we need lots of edges to connect two distant nodes, the to- 
tal Euclidean length along the path can be minimized as 
the geodetic distance between two nodes. Thus, the ge- 
ographical efficiency e becomes large in this region. The 
above consideration supports that the global efficiency E 
measuring the number of edges to connect nodes is large 
for m < to c0 and small for m > m c0 . It should be noted 
that the abrupt change in e or E occurs at to = m c o 
but not at m — m c i though the transition point m c i is 
related to the edge length. Since e and E behave op- 
positely, it is crucial to clarify which efficiency is more 
relevant to a given problem by considering how strongly 
the cost of the flow is influenced by the Euclidean dis- 
tance. It is also interesting that both efficiencies e and E 
are relatively high in the network at to = to c o at which 
the competition between order and disorder in the geo- 
metrical sense is balanced. 



VIII. EXAMPLE 

We will demonstrate the above described approach us- 
ing soil-pore networks as an example of a real- world spa- 
tially embedded network. Two approaches to build com- 
plex network models of soil-pore organization have re- 
cently been developed [3ll - l33[. H ere, we will concentrate 
on the networks presented in [3l[ formed by geographical 
threshold algorithm as described in Sec. [II] We consider 



a set of N pores representing the nodes of the network. 
The nodes of the network are the centers of the pores and 
the links between nodes are drawn according to Eq. ([1]) 
with Eqs. © and (J3|), where in this case the continu- 
ous fitness variable x is the size of the pore. Pore sizes 
and their relative positions of actual soil specimens are 
obtained from the image analysis of 2d soil X-CT scans 

a 

An example of the soil-pore network overlain on the 2d 
soil porous structure image is presented in Fig. 7 (from 
3 If ) . The pore-size distribution of the soil sample shown 
in Fig. 7 is analyzed to be of the form s(x) ~ x~ a with 
a = 1.6 (Fig. 8). The fractal dimension D = 1.32 of the 
soil-pore structure is also determined using box-counting 
method as shown in Fig. 9. Our theory predicts that 
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FIG. 9: (Color online) Number of boxes required to cover 
the soil-pore image shown in Fig. 7 as a function of the box 
size. The slope obtained by the least-squares fit (dashed line) 
indicates that the fractal dimension of the soil-pore structure 
is 1.32. 
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FIG. 10: (Color online) Degree distributions for three soil- 
pore networks formed by (a) m = 1, (b)m = 3, and (c) m = 5. 
Dashed lines through dots represent the power-law fits with 
the slopes indicated beside the lines. 7 t h in each panel gives 
the degree exponent predicted by Eq. (|11[) with D — 1.32 and 
a = 1.6. 



fractality of pore spatial distribution is reflected in the 
network organization. By measuring the scaling expo- 
nent 7 of the degree distribution for any to > m C Q and 
the exponent a of the pore-size distribution, the fractal 
dimension D can be determined. 

IX. CONCLUSIONS 

We have proposed a geographical scale-free network 
model with the nodes embedded in a fractal space and an- 
alytically and numerically studied several network prop- 
erties. The fractal dimension D of the embedding space 
was found to influence the scale-free exponent as 7 = 
to (a — 1)/D + 1 only if the spatial embedding is strong 
enough (i.e., when to > m c o) otherwise 7 = 2. The 
analyses of the average degree and average edge length 
revealed that this type of network can exist either in the 
non-compact, compact, or intermediate phase depending 
on the importance of the spatial arrangement of nodes. 
We derived the edge-length distribution functions for our 
network model and showed that it has a peak-like struc- 
ture similar to the profile of the shortest-path-distance 
distribution observed in a large-scale structure of the In- 
ternet [13, S| ■ It is interesting to apply our approach to 
modeling the Internet at the autonomous systems level 
considering the observed long-tailed distribution of au- 
tonomous systems sizes [35]. The measured degree dis- 
tribution exponent of the Internet is slightly larger that 
7 = 2 (7 = 2.1 ~ 2.2) [H,[25[ and seems to be decreasing 
with time (25J. In our network model this would be an 
indication of the evolution of the Internet towards the 
non-compact phase (to — > m c rj). 

We hope that our work will help in advancing the un- 
derstanding of the complex systems in which the hetero- 
geneity of intrinsic properties and the spatial arrange- 
ment of the elements play an important role. 



the soil-pore network has a power-law degree distribution 
function and the degree exponent 7 depends on m above 
to c o = 2.2. Figure 10 shows the degree distributions for 
three networks based on soil image data constructed with 
different values of the parameter to (to — 1, 3, and 5). 
In this figure, the power-law fits (dashed lines) and cal- 
culated exponents of the degree distributions are shown 
together with theoretically predicted scale- free exponents 
from Eq. (fTTj) . The results clearly demonstrate the agree- 
ment of the empirically obtained scale- free exponents and 
the theoretically predicted ones. This also shows how the 
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